Hmm Composition-based Rapid Model Ad Gmm Adaptation Evaluation O
نویسنده
چکیده
When a speech recognition system is used in a real environment, its recognition performance is affected by the surrounding noise. Most types of additional noise as well as SNRs are difficult to predict, so there is a mismatch between the training and test data. We need a method to deal with this problem. In this paper, we propose an HMM compositionbased model adaptation method with a priori noise GMM adaptation against the mismatch between different types of noise in noisy data. We also prepare multiple HMMs for several SNRs and select the one that can most effectively, based on the acoustic likelihood, deal with unknown SNRs. We carried out speech recognition experiments in noisy environments by using an AURORA2 task test set B. The results show 53% improvement in word accuracy from the baseline system with one-second real noise data used for adaptation. The performance is equivalent that of conventional HMM composition methods using ten-second real data.
منابع مشابه
Speaker adaptation of context dependent deep neural networks based on MAP-adaptation and GMM-derived feature processing
In this paper we propose a novel speaker adaptation method for a context-dependent deep neural network HMM (CD-DNNHMM) acoustic model. The approach is based on using GMMderived features as the input to the DNN. The described technique of processing features for DNNs makes it possible to use GMM-HMM adaptation algorithms in the neural network framework. Adaptation to a new speaker can be simply ...
متن کاملA scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR
We present a new scalable approach to using deep neural network (DNN) derived features in Gaussian mixture density hidden Markov model (GMM-HMM) based acoustic modeling for large vocabulary continuous speech recognition (LVCSR). The DNN-based feature extractor is trained from a subset of training data to mitigate the scalability issue of DNN training, while GMM-HMMs are trained by using state-o...
متن کاملGMM-derived features for effective unsupervised adaptation of deep neural network acoustic models
In this paper we investigate GMM-derived features recently introduced for adaptation of context-dependent deep neural network HMM (CD-DNN-HMM) acoustic models. We improve the previously proposed adaptation algorithm by applying the concept of speaker adaptive training (SAT) to DNNs built on GMM-derived features and by using fMLLR-adapted features for training an auxiliary GMM model. Traditional...
متن کاملRapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments
This paper describes a multi-template unsupervised speaker adaptation based on HMM-Sufficient Statistics. Multiple class-dependent models based on gender and age are used to push up the adaptation performance while keeping adaptation time within few seconds with just one arbitrary utterance. Adaptation begins with the estimation of speaker‘s class from the N-best neighbor speakers using Gaussia...
متن کاملSpeaker state recognition using an HMM-based feature extraction method
In this article we present an efficient approach to modeling the acoustic features for the tasks of recognizing various paralinguistic henomena. Instead of the standard scheme of adapting the Universal Background Model (UBM), represented by the Gaussian ixture Model (GMM), normally used to model the frame-level acoustic features, we propose to represent the UBM by building monophone-based Hidde...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002